Search CORE

83 research outputs found

Recommended from our members

Improving Performance of M-to-N Processing and Data Redistribution in In Transit Analysis and Visualization

Author: Bethel E Wes
Ferrier Nicola
Gu Junmin
Kress James
Logan Jeremey
Loring Burlen
Rizzi Silvio
Shudler Sergei
Wolf Matthew
Publication venue: eScholarship, University of California
Publication date: 25/05/2020
Field of study

In an in transit setting, a parallel data producer, such as a numerical simulation, runs on one set of ranks M, while a data consumer, such as a parallel visualization application, runs on a different set of ranks N. One of the central challenges in this in transit setting is to determine the mapping of data from the set of M producer ranks to the set of N consumer ranks. This is a challenging problem for several reasons, such as the producer and consumer codes potentially having different scaling characteristics and different data models. The resulting mapping from M to N ranks can have a significant impact on aggregate application performance. In this work, we present an approach for performing this M-to-N mapping in a way that has broad applicability across a diversity of data producer and consumer applications. We evaluate its design and performance with a study that runs at high concurrency on a modern HPC platform. By leveraging design characteristics, which facilitate an “intelligent” mapping from M-to-N, we observe significant performance gains are possible in terms of several different metrics, including time-to-solution and amount of data moved

eScholarship - University of California

Extending OLAP Querying to External Object

Author: Gu Junmin
Jensen Christian Søndergaard
Pedersen Torben Bach
Shoshani Arie
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2000
Field of study

VBN

Grid collector: an event catalog with automated file management

Author: Gu Junmin
Shoshani Arie
Sim Alexander
Wu Kesheng
Zhang Wei-Ming
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2003
Field of study

High Energy Nuclear Physics (HENP) experiments such as STAR at BNL and ATLAS at CERN produce large amounts of data that are stored as files on mass storage systems in computer centers. In these files, the basic unit of data is an event. Analysis is typically performed on a selected set of events. The files containing these events have to be located, copied from mass storage systems to disks before analysis, and removed when no longer needed. These file management tasks are tedious and time consuming. Typically, all events contained in the files are read into memory before a selection is made. Since the time to read the events dominate the overall execution time, reading the unwanted event needlessly increases the analysis time. The Grid Collector is a set of software modules that works together to address these two issues. It automates the file management tasks and provides ''direct'' access to the selected events for analyses. It is currently integrated with the STAR analysis framework. The users can select events based on tags, such as, ''production date between March 10 and 20, and the number of charged tracks > 100.'' The Grid Collector locates the files containing relevant events, transfers the files across the Grid if necessary, and delivers the events to the analysis code through the familiar iterators. There has been some research efforts to address the file management issues, the Grid Collector is unique in that it addresses the event access issue together with the file management issues. This makes it more useful to a large variety of users

Crossref

eScholarship - University of California

UNT Digital Library

Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting

Author: Chang C. S.
Choi Jong
Churchill R. Michael
Gu Junmin
Klasky Scott
Ku Seung-Hoe
Lin Paul
Podhorszki Norbert
Wu Kesheng
Publication venue
Publication date: 02/11/2023
Field of study

This work starts an in situ processing capability to study a certain diffusion process in magnetic confinement fusion. This diffusion process involves plasma particles that are likely to escape confinement. Such particles carry a significant amount of energy from the burning plasma inside the tokamak to the diverter and damaging the diverter plate. This study requires in situ processing because of the fast changing nature of the particle diffusion process. However, the in situ processing approach is challenging because the amount of data to be retained for the diffusion calculations increases over time, unlike in other in situ processing cases where the amount of data to be processed is constant over time. Here we report our preliminary efforts to control the memory usage while ensuring the necessary analysis tasks are completed in a timely manner. Compared with an earlier naive attempt to directly computing the same diffusion displacements in the simulation code, this in situ version reduces the memory usage from particle information by nearly 60% and computation time by about 20%

arXiv.org e-Print Archive

A chromosomelevel genome assembly of the Asian arowana, Scleropages formosus

Author: Bian Chao
Gu Ruobo
Hu Yinchang
Huang Yu
Kuznetsova Inna S.
Li Jia
Mu Xidong
Orban Laszlo
Qui Ying
Ravi Vydianathan
Shen Xueyan
Shi Qiong
Sun Ying
Venkatesh Byrappa
Xinxin You
Xu Junmin
Xu Pao
Yu Hui
Zhang Xinhui
Publication venue: Nature Publishing Group
Publication date: 01/01/2016
Field of study

Asian arowana (Scleropages formosus), an ancient teleost belonging to the Order Osteoglossomorpha, has been a valuable ornamental fish with some varieties. However, its biological studies and breeding germplasm have been remarkably limited by the lack of a reference genome. To solve these problems, here we report high-quality genome sequences of three common varieties of Asian arowana (the golden, red and green arowana). We firstly generated a chromosome-level genome assembly of the golden arowana, on basis of the genetic linkage map constructed with the restriction site-associated DNA sequencing (RAD-seq). In addition, we obtained draft genome assemblies of the red and green varieties. Finally, we annotated 22,016, 21,256 and 21,524 protein-coding genes in the genome assemblies of golden, red and green varieties respectively. Our data were deposited in publicly accessible repositories to promote biological research and molecular breeding of Asian arowana

ZENODO

ResearchOnline at James Cook University

PubMed Central

Electronic Archiving System

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

ScholarBank@NUS

Storage Resource Manager version 2.2: design, implementation, and testing experience

Author: Abadie Lanna
Badino Paolo
Baud J P
Corso Ezio
De Witt Shaun
Donno Flavia
Fuhrmann Patrick
Gu Junmin
Koblitz Birger
Lemaitre Sophie
Litmaath Maarten
Litvintsev Dimitry
Lo Presti Giuseppe
Magnoni Luca
McCance Gavin
Mkrtchan Tigran
Mollon Rémi
Natarajan Vijaya
Perelmutov Timur
Petravick Don
Shoshani Arie
Sim Alex
Smith David
Tedesco Paolo
Zappi Riccardo
Publication venue: 'IOP Publishing'
Publication date: 31/10/2007
Field of study

Storage Services are crucial components of the Worldwide LHC Computing Grid Infrastructure spanning more than 200 sites and serving computing and storage resources to the High Energy Physics LHC communities. Up to tens of Petabytes of data are collected every year by the four LHC experiments at CERN. To process these large data volumes it is important to establish a protocol and a very efficient interface to the various storage solutions adopted by the WLCG sites. In this work we report on the experience acquired during the definition of the Storage Resource Manager v2.2 protocol. In particular, we focus on the study performed to enhance the interface and make it suitable for use by the WLCG communities. At the moment 5 different storage solutions implement the SRM v2.2 interface: BeStMan (LBNL), CASTOR (CERN and RAL), dCache (DESY and FNAL), DPM (CERN), and StoRM (INFN and ICTP). After a detailed inside review of the protocol, various test suites have been written identifying the most effective set of tests: the S2 test suite from CERN and the SRM-Tester test suite from LBNL. Such test suites have helped verifying the consistency and coherence of the proposed protocol and validating existing implementations. We conclude our work describing the results achieved

CERN Document Server

The Asian arowana (<i>Scleropages formosus</i>) genome provides new insights into the evolution of an early lineage of teleosts

Author: Bian Chao
Chen Jieming
Fan Guangyi
Fan Mingjun
Gu Dangen
Hu Guojun
Hu Yinchang
Huang Yu
Kabilov Marsel
Komissarov Aleksey S.
Kuznetsova Inna S.
Li Jia
Li Wujiao
Li Xiaofeng
Liu Chao
Liu Yi
Luo Jianren
Mu Xidong
O'Brien Stephen J.
Orbán László
Peng Chao
Polgar Gianluca
Qiu Ying
Ravi Vydianathan
Ruan Zhiqiang
Shen Xueyan
Shi Qiong
Song Hongmei
Song Linsheng
Sun Ying
Tang Zhujing
Tay Boon Hui
Thevasagayam Natascha May
Trifonov Vladimir
Tupikin Alexey
Venkatesh Byrappa
Wang Jian
Wang Jun
Wang Min
Wang Xuejie
Xiong Zijun
Xu Gangchun
Xu Junmin
Xu Pao
Xu Xun
Yang Huanming
Yang Yexin
You Xinxin
Yu Hui
Zeng Peng
Zhang He
Zhao Xiaomeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The Asian arowana (Scleropages formosus), one of the world’s most expensive cultivated ornamental fishes, is an endangered species. It represents an ancient lineage of teleosts: the Osteoglossomorpha. Here, we provide a high-quality chromosome-level reference genome of a female golden-variety arowana using a combination of deep shotgun sequencing and high-resolution linkage mapping. In addition, we have also generated two draft genome assemblies for the red and green varieties. Phylogenomic analysis supports a sister group relationship between Osteoglossomorpha (bonytongues) and Elopomorpha (eels and relatives), with the two clades together forming a sister group of Clupeocephala which includes all the remaining teleosts. The arowana genome retains the full complement of eight Hox clusters unlike the African butterfly fish (Pantodon buchholzi), another bonytongue fish, which possess only five Hox clusters. Differential gene expression among three varieties provides insights into the genetic basis of colour variation. A potential heterogametic sex chromosome is identified in the female arowana karyotype, suggesting that the sex is determined by a ZW/ZZ sex chromosomal system. The high-quality reference genome of the golden arowana and the draft assemblies of the red and green varieties are valuable resources for understanding the biology, adaptation and behaviour of Asian arowanas

Copenhagen University Research Information System

PubMed Central

Research Repository

NSU Works